|
In statistics, deviance is a quality of fit statistic for a model that is often used for statistical hypothesis testing. It is a generalization of the idea of using the sum of squares of residuals in ordinary least squares to cases where model-fitting is achieved by maximum likelihood. ==Definition== The deviance for a model ''M''0, based on a dataset ''y'', is defined as:〔McCullagh and Nelder (1989)〕 : Here denotes the fitted values of the parameters in the model ''M''0, while denotes the fitted parameters for the "full model" (or "saturated model"): both sets of fitted values are implicitly functions of the observations ''y''. Here the full model is a model with a parameter for every observation so that the data are fitted exactly. This expression is simply −2 times the log-likelihood ratio of the reduced model compared to the full model. The deviance is used to compare two models – in particular in the case of generalized linear models where it has a similar role to residual variance from ANOVA in linear models (RSS). Suppose in the framework of the GLM, we have two nested models, ''M1'' and ''M2''. In particular, suppose that ''M1'' contains the parameters in ''M2'', and ''k'' additional parameters. Then, under the null hypothesis that ''M2'' is the true model, the difference between the deviances for the two models follows an approximate chi-squared distribution with ''k''-degrees of freedom.〔 Some usage of the term "deviance" can be confusing. According to Collett:〔Collett (2003)〕 : "the quantity is sometimes referred to as a ''deviance''. This is () inappropriate, since unlike the deviance used in the context of generalized linear modelling, does not measure deviation from a model that is a perfect fit to the data." However, since the principal use is in the form of the difference of the deviances of two models, this confusion in definition is unimportant. 抄文引用元・出典: フリー百科事典『 ウィキペディア(Wikipedia)』 ■ウィキペディアで「Deviance (statistics)」の詳細全文を読む スポンサード リンク
|